Extending WordNet using Generalized Automated Relationship Induction
نویسندگان
چکیده
This paper describes a Java package for automatically extending WordNet and other semantic lexicons. Extending these semantic lexicons by traditional means of hand labeling word relationships is a very expensive and laborious process. We used machine learning techniques to automatically extract relationships between words from a given text corpus. The package is made to be very flexible, allowing for various modules, such as new classifiers and semanitic lexions, to be “plugged-in.” The power of the package comes form its ability to seamlessly integrate the Stanford Parser to WordNet. Results obtained for various tests, particularly those done for a Naïve Bayes classifier, are promising.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملUsing and Extending WordNet to Support Question- Answering
Over the last few years there has been increased research in automated question-answering from text, including questions whose answer is implied, rather than explicitly stated, in the text. WordNet has played a central role in many such systems (e.g., 21 of the 26 teams in the recent PASCAL RTE3 challenge used WordNet), and thus WordNet is being increasingly stretched to play more semantic task...
متن کاملUsing an Ontology for Improved Automated Content Scoring of Spontaneous Non-Native Speech
This paper presents an exploration into automated content scoring of non-native spontaneous speech using ontology-based information to enhance a vector space approach. We use content vector analysis as a baseline and evaluate the correlations between human rater proficiency scores and two cosine-similarity-based features, previously used in the context of automated essay scoring. We use two ont...
متن کاملAutomated Discovery of Telic Relations for WordNet
A method is presented for automatically extending WordNet with the telic relationships proposed in Pustejovsky’s lexicon model. The method extracts telic relationships from WordNet glosses by first selecting a telic word through a pattern matcher aided by a part-of-speech tagger and then employing a word disambiguation module to select the specific meaning (synset) of the telic word. The method...
متن کاملExtending HCONE-Merge by Approximating the Intended Meaning of Ontology Concepts Iteratively
A central aspect of HCONE-merge is the mapping of ontology concepts to a hidden intermediate ontology by uncovering the intended meaning of concepts. Such a mapping is realized by a semantic morphism from ontology concepts to WordNet senses. Extending methods that have already been proposed, this paper proposes an iterative algorithm for approximating the intended meanings of ontology concepts ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007